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Description 

Fteld of the invention 

5 [0001 ] The present Invention relates generally to the field of automated image analysis and In particuiarto the problem 
of recDgnlzIng and Identifying human faces contained therein. 

Background of the Invention 

10 [0002] Recognition of faces from still Images or video data. Including verification and identification thereof, has a 
wide range of commercial applications Including video indexing of large databases, security access and numerous 
other multimedia applications. The face recognition problem can be described generally as follows: Given a still image 
or a video sequence as input, the required task Is to identify one or more persons in the input ImagB(s) based, for 
example, on the contents of a previously stored database of faces. A high quaflty face recognition system should be 

15 able to match an input or test image to the stored database even when there are variations in lighting, baclcground, 
facial expression, position, scale, or even when there are changes In hairstyles between the Input Image and the training 
lmage(6) upon which the database Is based. 

[0003] The face Identification problem in particular requires that the system perfomi a one-to-many mapping, deter- 
mining whether the person In the Input Image matches the data for any of the individuals contained in the stored 

20 database, and, if so, which one. The face authentication problem, on the other hand, requires that the system perform 
a one-to-one mapping, detemnining whether the input image (adequately) matches the data for one particular individual 
In the database - typically, the data for the Individual whose Identity is claimed by the person using the system. Face 
authentication has applications in thie security field, while face identification has applications in security as well as In 
fields such as newspaper reporting where it may be desirable to retrieve Images of particular Individuals from large 

?^ muithnedia databases. 

[0004] A typical face recognition system consists of three parts - a preprocessing step to segment the data and 
extract critical areas or features* a feature selection step, and a classification step. The preprocessing step may, for 
example, consist of a rough segmentation of the Input image to isolate the face data from the background data, or it 
may comprise a more detailed segmentation In order to locate facial parts such as the eyes, nose and mouth which 

30 rney then be used to generate the features. The preprocessing step may also extract feature vectors such as the 
geometrical relationships between the facial parts, in any event, once the preprocessing Is done, the area-of-interest, 
i.e., the face, has typically been extracted from the rest of the Image and normalized (in size and position). Since such 
preprocessing Is conventional and well known to those of ordinary skill in the art, the remainder of the discussion herein 
will focus on the feature extraction and classification portions of the face recognition process. 

35 [0005] Several techniques for face recognition have been uaedand/orproposedwhich are b ascd on extracting critk»l 
facial parts or detemnlning a geometric relationship between particular facial paite, and using these as the identifying 
features to be matched against a database. Unfortunately, these methods typteaily depend on the ability to locate the 
facial parts with very high accuracy. This can become very difficult If the images are not captured In a very controlled 
environment. Moreover, most of these techniques rely on the accurate detection of edges In the Images, a local oper- 

iO ation which Is not particularly robust to changes in lighting or to poor quality images with a significant amount of noise 
therein. 

[0006] Techniques which attempt to measure geometric relationships between facial parts such as, for example, the 
distance between the eyes, the width of the mouth, the distance from an eye to the nose, or the distance from an eye 
to the mouth, and then attempt to use these parameters as the features against which the data is matched between 
'iS the Input {Le.. test) image data and the training data {I.e., the data stored in the database) include, for example, R. 
Brunelll and T. Poggio, "Face Recognition through Geometrical Features," Proc. Europe. Conf. on Computer Vision, 
pp. 792-800, 1992. 

[0007] Techniques which, on the other hand, do not depend on locating feu^ial parts for the feature selection process 
Include, for example, L. Sirovich and M. Klrby, "Low-Dimensional Procedure for the Characterization of Human Face," 

5a J. opt. Soc. Amer., vol4, pp. 519-524, 19B7, which uses the well Icnown Karhunen-Loeve transform for face recognition, 
and M. A. Turk and A. P. Pentland, "Face Recognition Using Elgenfaces," Proc. int. Conf. on Pan. Recog., pp. 586- 
591. 1991 , which defines and uses "eigenfaces" for face detection and recognition. In each of these approaches, each 
person's face is essentially represented as a vector of weights applied to a predefined set of "eigenfaces." One weak- 
ness of this particular approach Is that if the input Image is not similar to the images which were collected to compute 

.S5 the "elgenfaces" In temns of scale and head position, the recognition rates drop dramatically. 

[0008] Finally, neural networks have also been used for face recognition. The neural network approach, however, 
typically requires a great deal of training data (e.g., several hundred Images for each person In the database), thus 
making such an approa^ Impractical for a large database. Each of the above described prior art approaches as well 
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as others are described in R. Chellappa, C.L. WHson and S. Sliohey. "Human and Machine Recognltton of Faces: A 
Survey," Proceedings o1 the IEEE» May 1995. 

Summary of the fnventton 

[0009] A method according to the Invention of matching a given object comprised In an Input image signal to one of 
a plurality of known objects, a representation of each of the known objects having been stored In a database and 
comprising a codebook corresponding thereto, each codebook comprising a plurality of codevectors having been gen- 
erated based on one or more conrespondlng training image signals comprising the con-esponding known object, conv 
10 prises the steps of: decomposing a portion of the Input image signal which Includes the given object Into a plurality of 
blocks; generating a plurality of Input vectors con-espondlngto the plurality of blocks, each input vector based on the 
block corresponding thereto; and matching the given object to one of the known objects based on one or more of the 
input vectors and on one or more of the codevectors comprised In one or more of tha codabooks corresponding to the 
known objects. 

r;f [001 0] The step of matching the given object to the one of the known objects may comprise: performing a plurality 
of encodings of the at least a portion of the Input Image signal, each encoding comprising coding one or more of the 
input vectors basedon one or more of the codevectors comprised in a corresponding one of thecod^ooks; determining 
a plurality of coding enrora conrespondlng to the plurality of encodings; and matching the given object to the one of the 
known objects based on the plurality of coding enx>rs. The coding errors comprise mean squared errors. The method 
may further comprise the step of comparing the plurality of coding errors to detemnlne a smallest coding error, and the 
step of matching the given object to the one of the known objects may comprise selecting the known ot)iect which 
corresponds to the codebook which produced die smallest coding error when the at least a portion of the Input Image 
signal was encoded therewith. 

[001 1 ] The step of matching the given object to the one of the known objects may comprise: generating a codebook 
corresponding to the given object based on one or more of the Input vectors; comparing the codebook corresponding 
to the given object with one or more of the codebooks corresponding to the known objects; and matching the given 
object to the one of the known objects based on the comparison of the codebook corresponding to the gh^en object 
with the codebooks conrespondlng to the known objects. Tha step of comparing the codebook corresponding to the 
given object with the codebooks corresponding to the known objects may comprise computing a mean squared error 
so for each comparison. The method may further comprise the step of detenninlng a minimum mean squared error from 
the computed mean squared errors, and the step of matching the given object to the one of the known objects may 
comprise selecting the known object v/hbh corresponds to a codebook which, when oomparsd to the codebook cor- 
responding to the given object, resulted In the minimum mean squared error. 

[001 2] The step of generating the input vectors comprises performing a linear transform such as a Discrete Cosine 
35 Transfomn on the blocks and generating the Input vectors based on the transfomied blocks. 

[0013] The codebooks corresponding to the known objects may comprise vector quantization codebooks. 

[001 4] The codevectors comprised In the codebooks corresponding to the known objects may have been generated 

from the one or more corresponding training Image signals with use of a k-means clustering technique. 

[00151 The method may furthercomprise the step of Identifying the gWen object as the known object which is matched 
40 to the given object. 

[001 6J The method may further comprise the step of verifying that a claimed identification of the gh^en object com- 
prises the known object which Is matched to the given object 

[0017] A method according to the Inventton of generating a database comprising a plurality of representations of 
known objects, the database for use in matching a given object comprised in an Input Image signal to one of the known 

45 objects, each of the representations of the known objects comprising a codebook corrasponding thereto, comprises 
the sCeps of: for each of the known objects, decomposing Into a plurality of blocks, a portion of each of one or more 
training image signals, the portion of each training Image signal Including the known object; for each of the known 
objects, generating a plurality of training vectors corresponding to the plurality of blocks, eadi training vector based 
on the btock con-espondlng thereto; and for each of the known objects, generating the codebook corresponding thereto 

50 based on one or more of the training vectors generated from the training image signals which Include the known object. 
[001 8] The step of generating the training vectors comprises perfomnlng a linear transform such as a Discrete Cosine 
Transtomi on the blocks and generating the training vectors based on the transformed blocks. 
[0019] Each of the generated codebooks may comprise a vector quantization codebook. 

[0020] Each of the codebooks may be generated from the con-esponding training vectore with use of a k-means 
clustering technique. 

[0021] The given object and each of the known objects may comprise a human face. 

[0022] A system according to the invention comprises means for. carrying out the steps of a method according to the 
tnventton. 
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[0023| The present Invention advantageously employs a signal processing approach which takes advantage of gen- 
eral image statistics* wKhout Incoiporating any ap/vonf Icnowledge of characteristics specific to facial image data. This, 
of course. Is In contrast to the prior art "computer vision" approaches described above which are, for the most part, 
based on heuristics and are, in general, not particularly robust to a wide range of Input Image variations. Speciflcatty, 

5 in accordance with the present Invention, recognition of objects such as faces in Images Is achieved using signal 
compression techniques (/.a, coding techniques) In which a portion of the Image which Includes the object to be rec- 
ognized le.g., the face) Is coded, and the resultant coded data is matched against previously coded training data which 
Is stored in a known object database. By using image compression techniques, each object (e.p., person's face) can 
be advantageously represented by a small set of numbers. 

10 [0024] In accordance with certain Illustrative embodiments of the present Invention, for example, a portion of an Input 
Image which Includes an object {e.g,, a face) Is decomposed Into blocks of pbcel data, a (linear) transform such as, for 
axample. a Discrote Cosine Transf omi (DCT) is performed on the pixel data of each block, and the transformed blocks 
are "Vector quantized" (VQ) to produce a codebook of codavectors for each person. (The block-based transfomi has 
the effect of advantageously compacting the signal energy of the block Into a few significant transform coefficients, 

15 which coefficients can then be advantageously used to represent the block. The transform coefficients representing 
each block of data In the training set may be used to generate a codebook of codevectors.) Generally, the number of 
Input vectors may be much greater than the number of codevectors stored In the codebook - thus, the amount of data 
needed to represent the training data may be advantageously compressed even further. 

[0025] Block-based transforms (Including DOT) and vector quantization (VQ) are both well known techniques com- 
eo monly used in the field of Image coding and fully familiar to those of ordinary skill In the art. Discrete Cosine Transfonns 
are described, for example, in N. Ahmed, T. Natarajanand K. Rao, "Discrete Cosine Transform," IEEE Trans. Comput., 
vol 23, pp. 90-93, 1974 and vector quantization is described, for example, In Y. Unde, A. Buzo and R, M. Gray, "An 
Algorithm for Vector Quantizer Design," IEEE Trans. Commun., COM-28. pp. 84-95, 1990. One liiustrative technique 
whteh may be advantageously used to generate the codebook codevectors from the training data is known as "k-means 
clustering," also familiar to those of ordinary skill in the art and described, for example, In S. P. Lloyd, "Least Squares 
Quantization In PCM," IEEE Trans. Infonn. Theory. IT-28, pp. 127-135, 1982. "Discrete Cosine Transform," "An Algo- 
rithm for Vecto r Quantizer Design" and "Least Squares Quantization In PCM** are each hereby Incorporated by reference 
as If fully sot forth herein, 

[0026] In accordance with certain Illustrative embodiments of the present invention, a given object In an input image 
30 signal is matched to one of a piuratlty of known objects stored In a database, wherein the stored representation of each 
of the known objects comprises a codebook generated based on training image signals comprising the known object. 
A first illustrative embodiment comprises the steps of decomposing the gh/en object Into blocks; performing a plurality 
of encodings of the given object, each encoding comprising coding the object with use of one of the codebooks; de- 
tenninlng a coding en^r for each encoding; and matching the given object to one of the known objects based on the 
35 coding errors. A second Illustrative embodiment comprises the steps of decomposing the given object Into blocks; 
generating a codebook corresponding to theglven object based on the blocks; comparing the codebook corresponding 
to the given object with the codebooks conresponding to each of the known objects; and matching the given object to 
one of the known objects based on the comparison of the codebooks. 

40 Brief Description of the Drawings 

[0027] Rg. 1 shows an overview of a database generation process whereby the data for a known face is constructed 
In accordance with an Illustrative embodiment of the present lnventk>n. 

[0028] Fig. 2 shows an overview off a face recognition process In accordance with an Illustrative embodiment of the 
4s present Invention. 

[0029] Fig. 3 shows a flow diagram of an illustrative codebook generator of the Illustrative database generation proc- 
ess at Fig. 1 . 

[0030] Fig. 4 shows a flow diagram of the classifier of the Illustrative face recognition process of Fig. 2 In accordance 
with a first Illustrative embodiment of the present Invention. 
50 [0031 ] Fig. 5 shows a flow diagram of the classifier of the illustrative face recognition process of Fig. 2 In accordance 
with a second Illustrative embodiment of the present invention. 

Petalled Description 

55 [0032] Rg. 1 shows an oveivtew of a database generation process whereby the data for a known face Is constructed 
in accordance with en Illustrative embodiment of the present lnventk>n. Several different training images, each com- 
prising a different "picture" of a known IndMduars face, may be advantageously provided to the process of F^. 1, 
thereby resulting In the generation of a codebook which will more completely reflect Images of the given known face 
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under different conditions such ae dtrferent tighting. background, facial expression, position, scale, etc. In particular, 
the process of Rg. 1 Is advantageously repeated for each person whose face Is to be Included in the database - thus] 
a different codebook Is generated by the process of Fig. 1 for each different person In the training set. As shown In the 
figure, one or more training images (each containing an Image of the face of the Ith person in the training set) are 

a processed, resulting In codebook 23-1 whksh contains a set of codevectors for that one particular person . 

{0033] Referring to the figure, first (as described above), preprocessor 21 extracts the region of Interest (Aa, the 
face) from the input Image, and nomaallzes the scale thereof. Thus, all further processing can be advantageously 
perfomned on the face (and only on the face), and ail faces presented to the system will be of the same scale {/.e„ 
size). The preprocessing performed by preprocessor 21 is conventional and well known to those skilled tn the art. 

10 [0034] After the preprocessing has been completed, the extracted portion(s) of the Imaga(s) are decomposed Into 
blocks, and a linear, btock-based transfomi is performed by transfomi 22 on each block. The blocks may, for example, 
comprise an area of size 8x8 (/.a, a square of dimension B pixels by 8 pixels), or, alternatively, they may be of size 
1 6x1 6, The blocks may be contiguous across the Image space or, preferably, they maybe overlapping (e.g., by 2 pixels 
In each dimension). By applying a linear transform operator to the original signal, a more efficient representation Is 

' obtained wherein the data Is statistically independent (or at least uncorrelated). Moreover, a linear transfomi may also 
provide substantial energy compaction, a generally desirable property In all signal compression techniques. One com- 
monly used transfonm in the field of Image data compression is the two-dimensional discrete cosine transform (DCT), 
well known to those of ordinary skill in the art. The DCT provides each of the above-mentioned advantages, and is 
currently the building block for all intematlonal compression standards for both still Images and video. 

20 [0035] By perfonming the transformation of transfomi 22 to each of the Image blocks, each of these blocks can be 
^ advantageously represented by Just a few DCT coefficients. In particular, transfonm 22 provides only the (L) coefficients 

with the greatest signal energy content out of the (BxB) coefficients whteh result from the transformation of the (BkB) 
block. Where the block size Is BxB (Le., B=8), for example. L«8 DCT coefftelents may be advantageously provWed; 
where the block size is 1 6x1 6 (/.©., B=16), for example. L«1 6 DCT coeffteients may be advantageously provided. As 

2iJ is well known to those of ordlnaiy skill in the art, the DCT coefficients with the greatest signal energy content typteally 
correspond to the low frequency coeffteients. 

[0036] The (generated and provided) DCT coeffteients for each block in the training data are assembled into a cor- 
responding vector, and the resultant set of vectors are then used by codebook generator 25 to design a VQ codebook. 
This codebook will thereby contain the data which represents the face of the given known Individual In the database. 

30 Note that the number of codebook vectors Is typteally much smaller than the total number of (DCT) vectors from which 
it is generated. For example, more than a thousand Input vectors may be advantageously provided to codebook gen- 
erator 25, and these may be mapped Into a codebook containing less than 100 (e.g., 64) codebook vectors [i.e., 
codevectors). The codebook design performed by codebook generator 25 may, for example, comprise the well known 
"k-means clustering" technique, or other conventtenal technk^ues for designing codebooks from training vector data 

35 as are well known in the field of signal compressten. 

[0037] Fig, 3 shows an illustrative flow diagram of the codebook generation process as may be performed by code- 
book generator 25 of the Illustrative training procedure of Fig. 1 . As explained above, the end result of the process of 
Fig. 1, whteh is advantageously performed for each person whose face is to be represented in the database, is to 
generate a codebook for a given peraon based on one or more training images containing his or her face. Thus, a 

^0 codebook, CP, is to be generated for each person, p = 1 ,2,...,N, where N Is the number of people (/.©.. faces) In the 
database. Each codebook consists of a plurality of codevectors, yj, where 1 = 1 ,2,...,M, and where M is the size of each 
^ codebook (/.©., the number of codevectore contained therein). Illustratively, M may be equal to 64. Each codevector 

consists of L values (i.e., DCT coefficients), as described above, liluslrath/ely, the blocks may be of size 8x8 and L may 
bo 8. 

45 [0038] The Illustrative procedure shown In Fig. 3 comprises an iterative process that generates a codebook by re- 

peatedfy match/ng the (ra/n/ng vectors to a sequence of "Intemiadlate" codebooks, CPq, Cp,,..., Cp^ modifying the 

codebook on each iteration (ie., replacing codebook Cp^ with Improved codebook Cp„„.,>. until a temilnating criterion 
Is met. The codebook whteh results from the final iteration then advantageously becomes the codebook which is used 
to represent the particular Individual's face In the database. 

50 [00391 Speclfteally. the lllustratlvo procedure of Rg. 3 begins with step 31 which Initializes the codebook to be used 
in the first iteration, CPq. to a random set of codevectora. These randomly chosen codevectors may, for example, 
comprise the first M vectors from the training set, or, alternatively, they may be chosen by generating random values 
therefor. Then, in each Iteration, m, step 32 uses the (entire) set of training vectors to fomi "cluster sets," familiar to 
those skilled In the art. In partteular, each training vector, where J » 1,2,...,J. is matched against each codevector 

« yP,, where 1 « 1 .2.....f^, In codebook Cp„. computing an error functton, d()^.yP,), therefor. As a result of this matching, 
each training vector, Xj, may be assigned to a cluster set, R|. essodated with a corresponding one of the codevectora 
yi, such that 
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R, = {X : d(x,/,) < d(x,/,5) tor aD k 9* I } 

That Is, each duster set, R|, consists of all the training vectors, x, for which the error (/.e. . the difference) between the 
training vector X and the rthcodevector, yPj. Is less than the error between the training vector x and any other codevector 
in the codebook. Put another way. each training vector Is assigned to the duster associated with the codevector in the 
current codebook to whteh It ts "closest," as determined by the error function, d(x,y). The error function d(x,y) may be 
any appropriate error metric. For example. d(x,y) may be the conventional mean square error (mse) metric, familiar to 
those of ordinary skill in the art. 

[0040] In step 33. a new codebook, CPni^-i • developed for use in the next iteration based on the duster sets which 
resulted from the analysis of step 32. In particular, for each cluster set, R,, the centroid of the set {I.e., an n-dimensionai 
weighted mean of the individual vectors in the set) Is computed. Then the new codebook (for the next Iteration) is 

constructed so as to comprise the set of computed centrokto. cent(R|), for each I » 1 .2 M. Note that if the mean 

squared error Is used as the envr metric, the centrold reduces to the arithmetic average. That is. 



for R| = x^, where j = 1 ,2,...J|Ril|, and where ORJj is the cardinality {i.e., the nuniber of elements) of the set Ri. 

[0041 ] In step 34, a total distortion measure for the given iteration, D^, is calculated based on the distance between 

each of the training vectors and the centroM to which It dusters. That Is, 




D„ = i:d(x,.cent(R^)), 

where the summation is perfonmed over all j « 1,2,..., J (/.e., over all training vectors). In this manner, D„ comprises a 
30 measure of accuracy with which codebook Cp„ Is able to code the given set of training vectors. Finally, decision 35 
computes the difference between the computed distortion measure (D,^ and the distortion measure computed for the 
previous Iteration (D„.^), in order to measure the degree of Improvement in the quality of the codebooks from the 
previous Iteration to the current one. if this difference Is less than a predetermined threshold. Tq, the iteration may be 
advantageously temiinated. since the improvement In codebook quality which Is likely to be gained by perfonning 
35 additional iterations Is small. 

[0042] Once a database of codebooks representing known faces has been generated, face recognition may be 
perfomfied In accordance with the prindples of the present invention based on the set of codebooks contained In the 
database. Fig. 2 shows an overview of af^ce recognition process in accordance with an fltustrative embodiment of the 
present Invention. Given a database of codebooks {e.g., codebooks 23-1, 23-2, .... 23-N), as generated, for example, 
AO with use of the iliustrative procedures o1 Figs. 1 and 3. the illustrative embodiment as shown in Fig. 2 may be used to 
identify (or to verify the Identily of) a person's face comprised in an input image signal. 

[0043] Rrst, in an identteat manner to that of the illustrative codebook generation process of Fig. 1 , preprocessor 21 
extracts the region of interest (/.e.. the face) from the Input Image, and nomriallzes the scale thereof. Thus, all further 
processing can be advantageously performed on the face (and only the face), and all faces presented to the system 
^5 will be of the same scale (/.a, size). Again, the preprocessing performed by preprocessor 21 is conventtonal and well 
known to those skilled In the cut. 

tQQ441 After the preproceaalng has been completed, and again In an Wentlcal manner to that of the illustrative code- 
book generation process of Fig. 1 , the extracted portion of the input Image Is decomposed Into blocks, and a linear, 
block-based transform Is performed on each btock by transform 22. Again, the blocks may, for exEunple, be of size 6x8, 
50 or, alternatively, of size 16x1 6, and they may be contiguous across the Image space or, preferably, overlapping. The 
same transform (such as a DOT) as was used In the generation of codebooks 23-1 to 23-N is advantageously used in 
the face recognition process of Fig. 2 as well. Moreover, the same block size Is advantageously used and the same 
sat of (L) coefficients is advantageously provided. 

[0045] Once the OCT Input vectors have been provided by transfomi 22, classifier 24 Is used to identify (or In the 
$5 case of face verification, to verify) the face contained in the Input image based on (known) face codet>ooks 23-1 to 
23-N. Specifically, dassifier 24 may. for exampte.'^determine which of the codebooks provides the best match to the 
input vectors whteh have been derived from the Input Image. In a face Identification context, the face associated with 
the "t^est matching" codebook may then be identified as the most likely candidate. However, if the error metric upon 
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which the comparisons are based (see, B.g., the discussion below) does not provide an "adequately" small total error 
value {Q.g., below a predetermined threshold) for any of the known face codebooks (thereby Indicating that no "suffi- 
ciently good" match was found), the system may not Identify any of the known faces as the face In the Input Imago. 
Rather, the system may singly detennine that the person whose face is In the Image Is, In fact, none of the people for 

s whom face data has been previously collected and stored In the database. 

[0046] In a face verification context, the system may verify or reject (/.e. , tall to verify) the claimed identity based on 
whether the claimed Identity matches the known face codebook which Is determined to be the best match with the face 
In the Input Image. Alternatively, the system nrmy verify or reject the claimed Identity based on whether the total error 
metric obtained when the Input Image is analyzed with respect to the partteular codebook associated with the face of 

10 the claimed identity Is "adequately" small, thereby Indicating a sufficiently ''good" match between the face In the Input 
image and the face of the claimed Identity. 

[0047] in accordance with certain embodiments of the present Invention, a detenmination of which known face code- 
book provides the best nnatch to the face In the Input Image can be made in a number of different ways. For example, 
two Illustrative embodiments of the operation of classifier 24 of the Illustrative face recognition process of Fig. 2 are 

Hi shown In Figs. 4 and 5, respectively. 

[0048] Rg. 4 shows a f tow diagram of the operation of classifier 24 of the Illustrative face recognition process of Fig. 
2 In accordance with a first lilustrath^e embodiment of the present Invention, in accordance with this embodiment, each 
of the known face codebooks are used to encode the Input image vectors, and a total coding error is determined for 
each such encoding. Then, the encoding which results In the lowest total coding error Is determined, and the face 

^> corresponding to the codebook used In that encoding Is Identified as the best match to the face tn the Input Image. 
[0049] Spectftealty, with reference to Fig. 4, the illustrative process begins with step 41 , whtoh perfonns a plurality 
of conventtonal VQ codebook encodings of the Input Image vectors, each such encoding performed with use of a 
different one of the known face codebooks (/.e.. codebooks 23-1 through 23-N). In parttoular, and as is well known to 
those of ordinary skill In the art. each of these encodings consists of selecting one of the codevectors from the given 
codebook for each of the Input Image vectors, wherein the selection Is made based on an error metric, such as, for 
example, a mean squared error. The selected codevector Is the one that provides a "best match* to the input vector - 
that Is, a codevector Is selected for each Input vector so as to advantageously minimize the en^or (/.©.. the difference) 
between the Input vector and the coding thereof (/.e.. the selected codevector). Thus, for each codebook, CP (where 
p ==1 ,2 N). containing codevectors, yP| (where I = 1 .2 M), each Input vector, Xj (where J 1 ,2,..., J), is compared to 

30 each of the codevectors to detemilne the one which results In a minimum error, given an error function d(x,y). In 
particular, dP„|n(xj), representing the minimum errorfor a coding of Input vector Xj with useof codebook CP Is determined 
for each input vector. Xj, as follows: 

35 d^m^ix^) = {d(Xj.yP,) : d(Xj./,) < dlx^.y^^ Tor afik^S, where I. k « 1 ,2....,M) 

[0050] After each of the encodings have been perf omned, step 44 of the illustrative procedure of Rg. 4 then caknjlates 

a total error, DP, for each of the encodings (/.a, for ali p « 1,2 N). These total en-ors are merely the sum of the 

individual coding en'ors, computed as foltows: 

40 

D^«xrf^^(xp 

where the summation is perfonned over alt J « 1 ,2,...,J, and where the indh^ldual values of dP^nC^j) determined In 
45 the coding of each input vector, X|, with use of the corresponding codebook, CP, as shown above. Thus, these total 
error vafues represent the overall quality (f,e., accuracy) of the encodings porfonnod with each of the codebooks. (Note, 
of course, that these total error values may alternatively and advantageously be computed during and as part of the 
encoding process itself, obviating the rteed for step 44 to t)e performed as a separate and distinct operation from the 
encodings performed In step 41 .) 
so [0051] Completing the illustrative preoadure of Fig. 4. step 46 datenmlnes the minimum total en^or value obtained 
over ali of the encodings (Lb., the value of p for whteh DP te minimum), and step 46 selects the known face which is 
represented by ^e codebook whteh was used In the encoding whtoh achieved the minimum total enror (iLe.. the best 
encoding). In Uila nnanner, the given face from the input Image may be advantageously identified as being the known 
face In the database whose associated codebook results In this "besT encoding ol the given face. 
55 [0052] Fig. 5 shows a flow diagram of the operation of classifier 24 of the Illustrative face recognition process of Fig. 
2 in accordance with a second IKustratlve embodiment of tiie present Invention. This illustrative embodiment provides 
a computationally simpler technk^ue whteh results In only sttghtiy Ipwer perfomriance than that of the illustrative proce- 
dure of Fig. 4. In particular, this second'tllustrative embodiment generates a codebook for the input Image, and then 
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compares this codebook to each of the known face codebooks In order to determine the "best match." Again, as In the 
illustrative procedure of Rg. 4. the lowest totat error value is deemed to be Indicailvo of the best noatch. Howaver. the 
difference between the Illustrative procedures of Figs. 4 and 5 Is that In the procedure of Rg. 5, rather than coding 
each of the (J> Input vectors (which may, for examplo. require the coding of thousands of vectors), step 42 produces 
5 a codebook, illustratively containing only M (e.p., 64) codevectors. therefrom. Then, this more limited set of vectors (/. 
e the M codevectors rather than the J Input vectors, where, typically. J » f^) are encoded with use of each of the N 
codebooks (step 43), and the total errors obtained from each of these (simpler) encodings are used (In steps 44-48) 
to select the known face represented by the codebook having the minimum total error 

[00531 specifically, the codebook generation of step 42 may comprise essenUally the same procedure as that illus- 
10 tratlvely used for generating the known face codebooks as shown In Rg. 3. In this case, however, the Illustrative training 

procedure of Rg. 3 Is applied to the Input Image data, rather than to training image data. (Note that there is typically 

only one image from which the Input Image vectors are obtained, rather than the potentially many images from which 

training Image vectors may be advantageously obtained.) The advantage of the use of the illustrative procedure of Fig. 

5 instead of the Illustrative procedure of Fig. 4. therefore, is that of having to compare only M (input Image derived) 
15 codevectors to M (training image derived) codevectors for each of N codebooks, rather than having to compare J (Input 

image derived) vectors to M (training Image derived) codevectors for each of the N codebooks. Since, typically. J » 
Ihis may advantageously result In significant computational savings. Again, the error metric used may be a mean 

squared error, or It may be any other error metric famUiar to those of ordinary skill In the art. 

[00541 Although a number of specific embodiments of this Invention have been shown and described herein, it Is to 
2c be understood that these embodiments are merely Illustrative of the many possible specifte arrangements which can 
be devised in application of the principles of the Invention. For example, although the description above has been 
focused on the recognillon of human faces, it will be obvious to those skilled In the art that the techniques of the present 
invention may be applied to any general image recognition problem, wherein a given object Is matched to one of a 
plurality of known objects, representations of which are stored In a database. In addition, although the embodiments 
^ of the present invention described above employ a mean squared error metric for measuring the difference between, 
for example Input vectors and codevectors, allematlve embodiments may use other error metrics, includmg. for ex- 
ample perceptual metrics which Incorporate perceptual criterion. Such perceptual metrics are familiar to those skilled 
In the 'art. Numerous and varied other arrangements can also be devised In accordance with the principles of the 
present Invention by those of ordinary skill in the art without departing from the scope of the invention. 

3V 

Claims 

1 A method of matching a given object comprised In an input image signal to one of a plurality of known objects, a 
3f: ' representation of each of the known objects having been stored In a database and comprising a codebook corre- 
sponding thereto, each codebook comprising a plurality of codevectors having been generated based on one or 
more corresponding training Image signals comprisfrig the conrespondlng known object, the method comprising 
the steps of: 

40 decomposing a portion of the input image signal which incfudas the given object Into a plurality of blocks; 

generating (22) a plurality of input vectors con-espondingto the plurality of blocks, each Input vector based on 
the block corresponding thereto; and 

4' matching (24) the given object to one of the known objects based on one or mora of the Input vectors and on 

one or more of the codevectors conH>ris«* one or more of the codebooks (23-1 23-N) corresponding to 

the known obiecta. 

2. The method of claim 1 wherein the step of niatching the given object to the one of the known objects comprises: 

performing a plurality ol encodings of the at least a portion of the input imago signal, each encoding comprising 
coding one or more of the Input vectors based on one or more of the codevectors comprised in a corresponding 
one of the codebooks; 

5:i determining a plurality of coding errors corresponding to the plurality of encodings; and 

matching the given object to the one of the known objects baaed on the plurality of coding enrors. 
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3. The method of claim 1 whoreln the stop of matching the given object to the one of the known objects comprises: 

generating a codebook con-espondlng to the gh^en object based on one or more ot the Input vectors; 
comparing the codebook corresponding to the given object with one or more of the codebooks corresponding 
.$ to the known objects; and 

matching the given object to the one of the known objects based on the comparison of the codebook corre- 
sponding to the ghren object with the codebooks corresponding to the known objects. 

4. A method of generating a database comprising a plurality of representations of known objects, the database for 
^o use in matching a given object comprised In an Input Image signal to one of the known objecta, each of the rep- 

resentattons of the known objects comprising a codebook <23-l) corresponding thereto, the method comprising the 
steps of: 

for each of the known objects, decomposing Into a piuraOty of btocks, a portion of each of one or more training 
v5 imago signals, the piortlon of each training Image signal Including the known object; 

tor each of the known objects, generating (22) a plurality of training vectors corresponding to the plurality of 
blocks, each training vector based on the block conresponding thereto; and 

for each of the known objects, generaUng (25) the codebook (2a-l) corresponding thereto based on one or 
more ot the training vectore generated from the training Image si£^als which Include the known object. 

no 

5. The method ot claim 4 wherein the step of generating the training vectors comprises perfomiing a linear transfomn 
on the blocks and generating the training vectore based on the transfonmed blocks. 

6. A system for matching a given object comprised In an Input Image signal to one of a plurality of known ot^ects, a 
lis representation of each of the known objects having been stored In a database and comprising a codebook conre- 
sponding thereto, each codebook comprising a plurality of codevedors having been generated based on one or 
more corresponding training image signals comprising the corresponding known object, the system comprising: 

means for decomposing a portion of the input image signal which Includes the given object into a plurality of 
so blocks; 

means for generating (22) a plurality of input vectors con-espondlng to the plurality of blocks, each Input vector 
based on the block corresponding thereto; and 

means tor matching (24) the gWen object to one ot the known objects based on one or more of the Input vectors 
an d on one or mo re ot the codevectore comprised In one or more of the codebooks (23- 1 , . .. ,23-N) correspond- 
:is ing to the known objects. 

7. The system of claim 6 wherein the moans for matching the given object to the one of the known objects comprises : 

means for performing a plurality of encodings of the at least a portion of the Input Image signal, each encoding 
K) comprising coding one or more of the Input vectors based on one or more of the codevectore comprised In a 

corresponding one of the codebooks; 

means for determining a plurality of coding errore corresponding to the plurality of encodings; and 

means for matching the given object to the one of the known objects based on the plurality of coding errors. 

'15 8. The system of claim 7 wherein the means for matching the gh^en object to the one of the known objects comprises: 

moans for generating a codebook corresponding to the given object based on one or more of the Input vectors; 
a comparator which compares the codebook corresponding to the given object with one or more of the code- 
books conresponding to the known objects; and 

means for matching the given object to the one of the known objects based on the comparison of the codebook 
corrasponcfing to the given object with the codebooks corresponding to the known objects. 

9. A system for generating a database comprising a plurality of representattons of known objects, the database for 
use in matching a given object comprised In an Input Image signal to one of the known objects, each of the rep- 
i(5 resentatlons of the known objects comprising a codebook (23-1) conresponding thereto, the system comprising: 

means, applied for each of the known objects, for decomposing Into a plurality of blocks, a portion of each of 
one or more training image signals, the portion of each training image signal Including the known object; 



9 



04/26/2005 15:24 FAX 7132388008 



CONLEY.ROSE 



1^02 



EP 0 806 739 B1 



means (22). applied for each of the known objects, for generating a plurality of training vectors connespondlng 
to the plurality of blocks, each training vector based on the block corresponding thereto; and 
means (25) applied for each of the known objects, lor generating the codebook (23-1) corresponding thereto 
based on one or more of the training vectors generated from the training Image signals which Include the 



known object. 



10 The system of cialm 9 wherein the means for generating the training vectors comprises means tor performing a 
* linear transfomi on the btocks and means for generating the training vectora based on the transformed blocks. • 



PatentansprUche 

1 Verfahren zum Anpassen elnes In einem EingangBblldslgnal enthaltenen gegebenen Objekts an eines von meh- 
reren bekannten Objeklen. wobol eine DarsteHung jedes dor bekannten Objekte In einer Datonbank gespeichert 
v worden ist und ein ihm enteprBchendes Codebuch enthfilt, wobei jedes Codebuch mehrere Codevektoren enthalt. 

dleaufderGrundlageelnesodermehrererentsprechenderTralnlngsblld-Slgnale.diedasentsprechendebekannte 

Objekt enthalten, erzeugt worden sind. wobei das Verfahren die folgenden Schrltte umfaf3t: 

Zerlegen eines das gagebene Objekl enthaltenden Abschnitts des Eingangsbildslgnals In mehrere Biacke; 
20 Erzougon (22) mehrerer EIngangsvektoren, die den mehreren Bidcken entsprechen. wobei jeder Elngangs- 

vektor auf denn ihm ontsprechenden Block baslert; und 

Anpassen <24) des gegebenen Objekts an elnes der bekannten Objekte auf dar Grundlage elnes oder meh- 
rerer der Eingangsvekioren und elnes oder mehrerer der Codevektoren. die In eInem oder mehreren der den 
bekannten Objekten ontsprechenden CodebOcher (23-1 23-N) enthailen sind. 

2. VerfahrennachAnspruchl.beidemderSchrtttdesAnpassensdesgegebenenObJekteandaselnederbekannten 
Objekte folgendes umfaBt: 

Ausfuhren mehrerer Codlerungen des mindestens einen AbschnHts des EIngangabttdslgnals. wobei jede Co- 
30 dierung das Codleren eines oder mehrerer der EIngangsvektoren auf der Gmndtege eines oder mehrerer der 

In elnem ontsprechenden der Codebtteher enthaltenen Codevektoren umfaBt; 
Bestimmen mehrerer Codlertehler, die den mehreren Codienjngen entsprechen; und 
Anpassen des gegebenen Objekts an das eIne der bekannten Objekte auf der Grundlage der mehreren Co- 
dierlehler. 

3. Verfahren nach Anspruch 1 . bel dem derSchritt des Anpassens des gegebenen Objekts an das eIne der bekannten 
Objekte folgendes umfaSt: 

Erzeugen elnes dem gegebenen Objekt ontsprechenden Codebuchs auf der Grundlage eines oder mehrerer 

41} der EIngangsvektoren: 

Verglelchen des dem gegebenen Objekt ontsprechenden Codebuchs mit elnem oder mehreren der den be- 
kannten Objekten entsprechenden Codebucher; und 

Anpassen des gegebenen Objekts an das eine der bekannten Objekte auf der Grundlage des Vergleichs des 
dem gegebenen Objekt entsprechenden Codebuchs mit den den bekannten Objekten entsprechenden Co- 
4:'} debOchern. 

4 Vettahren zum Erreugea elner Datonbank, die mehrere Darstellungen bekanntor Objekte enthalt, wobei die Da- 
' tenbank fOr die Anpasaung elnes In elnem Elngangsbildstgnal enthaltenen gegebenen Objekts an elnes der be- 
kannten Objekte venwendet wird. wobei jede der Darstellungen der bekannten Objekte eIn ihm entsprechendes 
5.) Codebuch (23-1) enthfllt, wobei das Verfahren die folgenden Schrltte umfarJt: 

Zerlegen eInesAbschnittsJedes von elnem oder mehreren Tralnlngsblld-Slgnalen In mehrere Bl6ckefQr jedes 
der bekannten Objekte, wobei dar AbschnItt jedes Tralnlngsbl Id-Signals das bokannte Objekt enthalt; 
Erzeugen (22) mehrerer Tralnlngsvektoren, die den mehreren Blacken entsprechen. fur jedes der bekannten 
55 Oblekte wobei jeder Tralnlngsvektor auf dem Ihm entsprechenden Block baslert; und 

Erzeuqen (26) des elnem bekannten Objfekt entsprechenden Codebuchs (23-1) auf der Gmndlage elnes Oder 
mehrerer der Tralnlngsvektoren. die aus den das bekannte Objekl enthaltenden Trainingsblld-Slgnalen er- 
zeugt werden, fOr jedes der bekannten Objekte. 
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5 Verfahron nach Anspruch 4, bel dom darSchrlttdes Erzeugens derTralnlngsvektorendas AuslQhren einerllnearen 
Transformation an den BIScken und das Erzeugen der Tralningsvoktoren auf der Grundlago der transfonmlerten 
Bldcke umfaBt. 

J 6 Svstem zum Anpassen elnes gegebenen Objokts. das In einem EIngangsblldslgnal emhalien 1st. an eines von 
mehreren bekannten Objekten. wcbel elne Darstellung Jedes dor bekannten Objekle in einer Datenbank gospel- 
chert wordon ist und eln Ihm enteprechendos Codebuch enthdlt. wobol jodos Codebuch mehrere Codevoktoren 
enth&lt. die aut dor Qrundlago elnes odor mehreror entsprechender TratnlngsbHd-Slgnalo, die das entsprechendo 
bokannto Objekt onthallen, erzeugt worden sind. wobol das System folgendos umfaBt: 

Mittel zum Zertegon olnos das gegobone Objokt onthattendon Abschnltts dos Eingangsblldslgnate In mehrere 

Bldcko' 

Mittel zum Erzeugen (22) mehreror Elngangsvoktoren, die don mohreren Bldckon entsprochen, wobol iodor 
EIngangsvektor aut dem Ihm entsprochenden Block baslort; und 
fs Mittel zum Anpaasen (24) des gegebenen Objekts an elnes der bekannten Objekte auf der Grundlage elnes 
Oder mehrerer der Elngangsvoktoren und elnes odor mehreror der CodoveWoron, die In elnem odor mohreren 
dor den bekannten Objekten entsprochenden CodebOchor (23-1 23-N) onthalten sInd. 

7. System nach Anspruch 6, bel dem die Mittel zum Anpassen des gegebenen Objekls an das elne der bekannten 
1*0 Objekte folgendes umfassen: 

Mittel zum AusfOhren mohrererCodiorungen des mindestens einon Abschnltts des Elngangsblldslgnals. wobol 
jede Codiorung das Codieren elnes odor mehrerer der Elngangsvoktoren auf dor Grundlage elnes oder meh- 
rerer der In elnem entsprechenden dor CodebQcher enthaltenen Codevoktoren umfaSt; 
•»5 Mittel zum Bestimmen mehrerer CodiOffehler, die den mohreren Codlerungen entsprechen; und 

Mittel zum Anpassen des gegebenen Objekte an das elne der bekannten Objekle auf dor Grundlage der 
mohreren Codlerfehler. 

a. System nach Anspruch 7. bel dem die Mittel zum Anpassen des gegebenen Objekts an das elne der bekannten 
iv Ob(ekte fofgendes umfassen: 

Minel zum Erzeugen elnes dem gegebenen Objokt entsprochenden Codebuchs auf der Grundlage elnes oder 
mehrerer dor Bngangsvektoron; ^ ^ . ^ u 

einon Komparator. dor das dem gegebenen Objekt entsprechendo Codebuch mlt einem odor mohreren der 
:t5 den bekannten Objekten entsprochenden CodebUcher vergletoht; und 

Mittel zum Anpassen des gegebenen Objekts an das elne der bekannten Objekte aut der Grundlage des 
Verglelchs des dem gegebenen Objokt entsprochenden Codebuchs mlt den den bekannten Objekten entspre- 
chenden CodebQchem. 

System zum Erzeugen einer Datenbank, die mehrere DarBiellungon bekanntor Objekte enthfllt. wobei die Daten- 
bank (Dr die Anpassung elnes In elnem EIngangsblldslgnal enthaltenen gegebenen Objekts an elnes der bekannten 
Objekte vomvendel wird. wobei Jade dor Darstellungen der bekannten Objekte eln Ihm entsprechendes Codebuch 
(23-1) enthait. wobei das System folgendes umfaBt: 

MIttoi, die aul jedes der bekannten Objekte angowendet wordon, zum Zertegen eines Abschnltts Jedes von 
einem odermehreren Tralningsbild-Slgnalen In mohrero BIflcke. wobe* der Abschnltt Jedes TralnlngsblW-SJ- 
gnals das bokannto Objekt enthfllt; , , 

Mtttel (22) . die auf Jedes der bekannten Objekte angowendet werden, zum Erzeugen mehrerer Trainlngsvek- 
toren . die den mohreren Blficken entsprechen. wobei jederXralnlngsvektor auf dem Ihm entsprechenden Block 

basiert; und . . 

Mittel (25), die auf jedes der bekannten Objekte angowendet werden, zum Erzeugen des ihm entsprechendsn 
Codebuchs (23-1) auf der Grundlage elnes odor mehreror der Tralningsvoktoren, die aus den das bekannte 
Objekt enthaltenden TralnlngsblW-Slgnalen erzeugt werden. 

10 System nach Anspruch 9, bel dom die Mltte! zum Erzeugen der Tralningsvoktoren Mittel zum AusfOhren einer 
* llnearen Transfoimation an don BI5cken. und Mittel zum Erzeugen der Trainlngsvektoron auf der Grundlage der 
transfomriierten Bldcko umfassen. ^ 
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Revendlcatlons 

1. Procdcfd do mlse on correspondanco d'un objet donn6 compris dans un signal dimage d'entr6e avec I'un d une 
pluralitd d'objots connus, une ropr68entatlon do chacun dos objote connus ayant 616 mdmoris^e dans une base 

5 do donndes ot conpronant un dicttonnairo do codos lui corrospondant, chaquo dictlonnaire de codes comprenanl 

uno pluralltO do vec^ours do codo ayant 6t6 gdndrds on fonctlon d'un ou d© plusloure signaux d image tfappron- 
tlssago corrospondante compronant I'objot connu corrospondant . le proc6d6 comprenant les 6tapes de 

dOcomposilion d'uno partio du signal d'lmago d'ontrdo qui comport© I'objat donno en une pluralito de blocs ; 
fo gOnfiration (22) d'uno pluralitd do voclours d'onlrAo corrospondant A la plurality de blocs, chaque vectour 

d'entr6o 6tant bas6 sur lo bloc lui corrospondant ; et 

miso on correspondanco (24) de i'objot donnd avec I'un des objots connus on fonctlon d'un ou do plusieurs 
des vocteufB d'ontr6o ot d'un ou do plusieurs dos voctours do codo compris dans un ou plusieurs des dlctlon- 
nalres da codos (23-1 23-N) correspondent aux objots connus. 

2. ProcOdd selon lo rovondtoatlon 1 , dans lequel l'6tap© do mlse on conrospondanco de robjet donnO avec I'un des 
objots connus comprond : 

l'ex6cution d'une pluralft6 d'encodages de la au molns une partio du signal d'Imago d'entr6o. chaquo oncodage 
20 comprenant un codago tf un ou do plusieurs des vecteurs tfontrO© onfonctlon tfun ou de plusieurs dos voctours 

de code compris dans un dictlonnaire correspondent des dictlonnalres de codes ; 
la determination d'uno plurality tforreurs de codago corrospondant & la plurallt6 d'oncodagos ; et 
)a mlse en correspondanco do robjot donnd d I'un dos objots connus en fonctlon do la pluralltO d'enours do 
codago. 

3. ProcOdO solon fa rovendlcatlon 1 . dans lequel I'Otapo do ntlso en correspondanco do I'objot donnd avec I'un dos 
objots connus comprend : 

la g6n6ranon d'un dictlonnaire de codes correspondent 0 I'objot donnd on foncUon d'un ou do plusieurs des 
vecteurs d'entr6o ; 

la comparalson du dictlonnaire do codes correspondent ft I'objot donnO avec un ou plusieurs des dictlonnalres 
de codes correspondent aux objets connus ; et 

la mlse en correspondance de I'objet donnO avec I'un des objets connus en fonctlon do la comparalson du 
diclionnairo de codos corrospondant & i'objet donnO avec les dictlonnalres de codes corrospondant aux objels 
connus. 

4. ProcOdO de gOndratlon d'une base do donn668 comprenant une plurallt6 do representations d'objels connus, la 
base de donn6es destln6e & fttro ullllsde dans la mlse en conrospondanco d'un obJet donnO compris dans un signal 
d'Imago d*ontrOo avec I'un des objots connus, chacuno dos representations des objets connus comprenant un 
dictlonnaire de codos (23-i) lui con-ospondant. le proc6d6 comprenant les fitapes do : 

pour chacun dos objots connus. decomposition en une plurallte de blocs d'uno partio de chacun d'un ou de 
plusieurs signaux d'image d'apprentlssago, ta partle de chaquo signal d'Imago d'apprentissage comportanl 
robjot connu ; 

pour chacun des objots connus, gOneraUon (22) d'uno piuralite de vecteurs d'apprentissage correspondent 0 
la plurallte de blocs, chaquo vectour d'apprentissage Otant base sur le bloc lui correspondant ; et 
pour cUacu^^ dos objots ooixnuB. gdneTatlon (25) du dtattonnairo do codos (23-1) lui con-espondant en tonctton 
d'un ou de plusieurs des voctours d'approntlssago gOnOres ft partir dos signaux d'image d'apprentissage qui 
comportent Pobjot connu. 

5 Precede solon la revendication 4, dans lequel l'6tapo de generation dos voctours d'apprentissage comprend I'oxO- 
cutlon d'uno transformee llneairo sur les blocs el la generation des voctours d'approntlssago en fonction des blocs 

transformds. 

6. Systeme de mlse en correspondance d'un objet donnO compris dans un signal d'Imago d'entreo avec Tun d'une 
plurallte objets connus, une ropresontaUon rfa chacun des objets connus ayant etO memorisOe dans uno base 
de donnOes et comprenant un dictlonnaire de codes lui correspondant. chaquo dictlonnaire do codes comprenant 
une plurallte do voctours de codo ayant ete gOndres on fonctlon d'un ou do plusieurs signaux d'image d'appren- 



©027 



EP0806 739B1 

tissage correspondants compranant robjet connu correspondant, le aystftme comprenant : 

un moyen pour dteomposer une partie du signal d'image d'entr6e qui cornporte fobjet donnd en una plurality 
de blocs ; 

5 un moyan pour g6n6rer (22) une plurality de vecteurs d'entr^e conrespondant h la pluratltd de blocs, chaque 

vecteur d'entr^e dtant basd sur le bfoc lul confespondant ; et 

un moyen pour adapter (24) robjel donnd h I'un dee objets connus en f onctlon d*un ou de plusleurs des vecteurs 
d'entrde et tfun ou de plusleurs des vecteurs decode comprts ctos un ou plusleurs des dictlonnalres da codes 
(23-1 23-N) correspondant aux objets connus. 

10 

7. Systdme selon la revendlcatlon 6, dans lequel le moyen de mise en correspondance de I'objet donn6 avec I'un 
des objsts connus comprend : 

un moyen pour exdcuter une plurality d'enoodages de la au molns une partie du signal d'Image d'entrde. 
15 chaque encodage comprenant un codage d'un ou de plusleurs dss vecteurs d'entr6e en fonctlon d'un ou de 

plusleurs des vecteurs de code comprls dems un dlctlonnalre correspondent des dictlonnalres de codes ; 
un moyen pour determiner une plurality d'erreurs de codage correspondant 6 la plurallt6 d*encodages ; et 
un moyen pour adapter Pobjet donnd & I'un des objets connus en fonctlon de la pluratltd d*erreurs de codage. 

^0 8. Systdme selon la revendlcatlon 7, dans lequel le moyen de mIse en correspondance de Tobjet donn6 avec Tun 

des objets connus comprend : 

un moyen pourg6n4rer un dictlonnalre decodes correspondant k Tobjet donnd en fonctlon d'un ou de plusleurs 
des vecteurs d'entr^e ; 

?5 un comparateur qui compare \e dlctlonnalre de codes correspondant k I'objet donnd avec un ou plusleurs des 

dictlonnalres de codes correspondant aux objets connus ; et 

un moyen pour mettre en correspondance I'objet donnd avec Tun des objets connus en fonctlon de la com- 
paralson du dlctlonnalre de codes correspondant 6 Tobjet donndavec les dictlonnalres de codes correspondant 
aux objets connus. 

.70 

9. Syst&me de generation d'une base de donn^es comprenant une plur€dlt6 de representations d'objets connus» la 
base de donndes destlnde h dtre utitlsee dans la mIse en correspondance d'un objet donnd comprls dans un signal 
dimage d*entree avec Vun des objets connus, chacune des representations des objets connus comprenant un 
dlctlonnalre de codes (23-1) lul correspondant, lesystdme comprenant : 

35 

un moyen, appllqud pour chacun des objets connus, pour decomposer en une pluraltte de blocs une partie 
de chacun d'un ou de plusleurs signaux d'Image d'apprentlssage, la partie da chaque signal d'image d*ap- 
prentlssage comportant Tobjet connu ; 

un moyen (22). applique pour chacun des objets connus, pour generer une plurallte de vecteurs d'apprentis* 
40 sage correspondant k la plurallte de blocs, chaque vecteur d'apprentlssage etant base sur le bloc lul 

correspondant ; et 

un moyen (25), applique pour chacun des objets connus, pour generer le dlctlonnalre de codes (23-1) lul cor- 
respondant en fonctlon d*un ou de plusleurs des vecteurs d'apprentlssage generes k partir des signaux d'Image 
d'apprentlssage qui comportent Pobjet connu. 

45 

10, Systbme se}on la r&vendlcatton 9, dans lequel le moyen de generation des vecteurs d'apprentlssage comprend 
un moyen pour executer une transformee ttneaire sur les blocs et un moyen pour generer les vecteurs d'appren- 
tlssage en fonctlon des blocs transformes, 

50 
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